首页> 外文OA文献 >Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes
【2h】

Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes

机译:利用12个果蝇基因组重新研究黑腹果蝇的蛋白质编码基因目录

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The availability of sequenced genomes from 12 Drosophila species has enabled the use of comparative genomics for the systematic discovery of functional elements conserved within this genus. We have developed quantitative metrics for the evolutionary signatures specific to protein-coding regions and applied them genome-wide, resulting in 1193 candidate new protein-coding exons in the D. melanogaster genome. We have reviewed these predictions by manual curation and validated a subset by directed cDNA screening and sequencing, revealing both new genes and new alternative splice forms of known genes. We also used these evolutionary signatures to evaluate existing gene annotations, resulting in the validation of 87% of genes lacking descriptive names and identifying 414 poorly conserved genes that are likely to be spurious predictions, noncoding, or species-specific genes. Furthermore, our methods suggest a variety of refinements to hundreds of existing gene models, such as modifications to translation start codons and exon splice boundaries. Finally, we performed directed genome-wide searches for unusual protein-coding structures, discovering 149 possible examples of stop codon readthrough, 125 new candidate ORFs of polycistronic mRNAs, and several candidate translational frameshifts. These results affect >10% of annotated fly genes and demonstrate the power of comparative genomics to enhance our understanding of genome organization, even in a model organism as intensively studied as Drosophila melanogaster.
机译:来自12个果蝇物种的测序基因组的可用性使比较基因组学得以用于系统发现该属中保守的功能元件。我们已经开发了针对蛋白质编码区域特有的进化标记的定量指标,并将其应用于全基因组范围内,从而在黑腹果蝇基因组中产生了1193个候选的新蛋白质编码外显子。我们已经通过人工策展回顾了这些预测,并通过定向cDNA筛选和测序验证了一个子集,揭示了新基因和已知基因的新替代剪接形式。我们还使用了这些进化特征来评估现有的基因注释,从而验证了87%缺乏描述性名称的基因,并鉴定了414个保守性很差的基因,这些基因很可能是虚假的预测,非编码或物种特异性基因。此外,我们的方法建议对数百种现有基因模型进行多种改进,例如对翻译起始密码子和外显子剪接边界进行修饰。最后,我们对不寻常的蛋白质编码结构进行了全基因组定向搜索,发现了149个可能的终止密码子通读实例,125个新的多顺反子mRNA候选ORF和几个候选翻译移码。这些结果影响了超过10%的带注释的果蝇基因,并证明了比较基因组学的功能可增强我们对基因组组织的理解,即使在像果蝇(Drosophila melanogaster)进行了深入研究的模型生物中也是如此。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号